Feature Engineering for Knowledge Base Construction

نویسندگان

  • Christopher Ré
  • Amir Abbas Sadeghian
  • Zifei Shan
  • Jaeho Shin
  • Feiran Wang
  • Sen Wu
  • Ce Zhang
چکیده

Knowledge base construction (KBC) is the process of populating a knowledge base, i.e., a relational database together with inference rules, with information extracted from documents and structured sources. KBC blurs the distinction between two traditional database problems, information extraction and information integration. For the last several years, our group has been building knowledge bases with scientific collaborators. Using our approach, we have built knowledge bases that have comparable and sometimes better quality than those constructed by human volunteers. In contrast to these knowledge bases, which took experts a decade or more human years to construct, many of our projects are constructed by a single graduate student. Our approach to KBC is based on joint probabilistic inference and learning, but we do not see inference as either a panacea or a magic bullet: inference is a tool that allows us to be systematic in how we construct, debug, and improve the quality of such systems. In addition, inference allows us to construct these systems in a more loosely coupled way than traditional approaches. To support this idea, we have built the DeepDive system, which has the design goal of letting the user “think about features— not algorithms.” We think of DeepDive as declarative in that one specifies what they want but not how to get it. We describe our approach with a focus on feature engineering, which we argue is an understudied problem relative to its importance to end-to-end quality.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Intelligent Knowledge Based System Approach for Optimization of Design and Manufacturing for Abrasive Water Jet Machining

A water jet machining is an industrial tool capable of cutting a wide variety of materials using a very high-pressure jet of water, or a mixture of water and an abrasive substance. This paper addresses the concept of the Intelligent knowledge base system (IKBS) for optimization of product design and manufacturing process for water jet machining in computer based concurrent engineering environme...

متن کامل

Intelligent Knowledge Based System Approach for Optimization of Design and Manufacturing for Abrasive Water Jet Machining

A water jet machining is an industrial tool capable of cutting a wide variety of materials using a very high-pressure jet of water, or a mixture of water and an abrasive substance. This paper addresses the concept of the Intelligent knowledge base system (IKBS) for optimization of product design and manufacturing process for water jet machining in computer based concurrent engineering environme...

متن کامل

Fault Detection of Bearings Using a Rule-based Classifier Ensemble and Genetic Algorithm

This paper proposes a reduct construction method based on discernibility matrix simplification. The method works with genetic algorithm. To identify potential problems and prevent complete failure of bearings, a new method based on rule-based classifier ensemble is presented. Genetic algorithm is used for feature reduction. The generated rules of the reducts are used to build the candidate base...

متن کامل

Domain Knowledge Engineering Based on Encyclopedias and the Web Text

Based on natural language text analysis, this paper intends to draw a basic framework for the construction of domain knowledge base. Using encyclopedia resources and text information resources on the Web, we focus on the method of constructing domain knowledge base through technologies in natural language text analysis and machine learning. Moreover, an open network platform will be developed, ...

متن کامل

The Construction of language Resource and Knowledge Base for Chinese Language Computing

With the urgent needs of enhancing the intelligence of the internet, Knowledge engineering is attracting high attention from both industry and academia. Different from the knowledge engineering (such as Dbpedia, Knowledge Graph, YAGO, etc.) based on structured knowledge resources, the task of textual knowledge engineering is to mine knowledge from unstructured natural language texts. One of the...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • IEEE Data Eng. Bull.

دوره 37  شماره 

صفحات  -

تاریخ انتشار 2014